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Abstract 

We establish a general formula for the distribution of the score in table tennis. We 
use this formula to derive the probability distribution (and hence the expectation and 
variance) of the number of rallies necessary to achieve any given score. We use these 
findings to investigate the dependence of these quantities on the different parameters 
involved (number of points needed to win a set, number of consecutive serves, etc.), with 
particular focus on the rule change imposed in 2001 by the International Table Tennis 
Federation (ITTF). Finally we briefly indicate how our results can lead to more efficient 
estimation techniques of individual players' abilities. 

1 Introduction. 

We consider the following situation. Two players (or teams), referred to as A and B, play 
a sequence of rallies after each of which either A or B is declared winner. Every rally is 
initiated by a server — the other player is then called the receiver — and a point is scored 
(by the winner) after each rally. The players continue competing until a maic/i-winner is 
declared, with a match being composed of several sets according to the (m, n, G7)-scoring 
system, which we define as follows. 

The (m,n,G) -scoring system: (i) A set consists of a sequence of independent 
rallies; the winner of a set is the first player to score n points or, in case of a 
tie at n — 1, the first player to create a difference of two points after the tie. 
(ii) A match consists of a sequence of independent sets; the winner of a match 
is the first player to win G sets, (iii) The server in the first rally of the first 
set is determined by flipping a coin; if G > 2, the right to serve in the first 
rally of each subsequent set alternates between the two opponents, (iv) Within 
a set, the right to serve changes between the opponents after each sequence of 
m consecutive rallies by a server until either a set-winner is declared or a tie is 
reached at (n — l,n — 1). (v) After a tie at (n — l,n — 1), the right to serve 
alternates after each rally. 
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To the best of our knowledge, table tennis is the only sport currently using the (m, n, G)- 
scoring system, although speed badminton (a.k.a. speedminton or, before 2001, shuttleball) 
can nevertheless be shown to fit, up to a minor modification, within the above scoring system 
for m = 3, n = 16 and G = 2 (the change concerns the right to serve in the first rally of each 
subsequent set, as here the player who has lost the previous set serves first). Until 2001, 
table tennis was played according to the (5, 21, GQ-scoring system, with G = 2 or 3. In 2001 
the International Table Tennis Federation (ITTF) decided to switch to the (2,11, G)-scoring 
system, with G either 3 or 4. 

In this article we regard table tennis as a succession of identical and independent random 
experiments (the rallies) and analyze the properties of the random processes (score change, 
game duration, etc.) induced by the scoring system. As in most of the (mathematical) 
literature on this kind of sport we will restrict our attention to two well-known models. The 
first is the so-called server model in which it is assumed that rally-outcomes are mutually 
independent and are, conditionally upon the server, identically distributed. Denoting by 
p a (resp., pb) the probability that player A (resp., player B) wins a rally he/she initiates, 
the game is then entirely governed by the bivariate parameter (p a ,Pb) £ (0, 1) x (0, 1). The 
second model under investigation is the simpler no-server model, in which it is assumed that 
rallies are won with probabilities that do not depend on the server; the latter model is thus 
a particular case of the former, with parameters p a = 1 — pb = p G (0, 1). For a discussion 
on the validity of these models we refer the reader to e.g. Klaassen and Magnus (2001) 
where is is shown, within the framework of the game of tennis, that i.i.d. models provide 
a good approximation to real games. We have not investigated whether their conclusions 
can be transposed to our framework, and reserve such empirical considerations for later 
publications. 

While an important literature has been devoted to sports such as tennis (see, e.g., Hsi 
and Burich 1971, George 1973, Carter and Crews 1974, Riddle 1988 or Klaassen and Mag- 
nus 2003) or badminton (see, e.g., Phillips 1978, Simmons 1989 or the recent contributions by 
Percy 2009 and Paindaveine and Swan 2011), the corresponding literature on table tennis is, 
to say the least, scant. Indeed, to the best of our knowledge, only one paper (Schulman and 
Hamdan 1977) addresses this issue and proposes a solution only for (m,n,G) = (5,21,2). 
This lack of interest perhaps originates in the apparent simplicity of the stochastics under- 
lying the game of table tennis. Indeed, whereas in sports such as badminton or tennis the 
number of consecutive serves by any player is random, the right to serve in table tennis 
changes according to a deterministic rule, and thus rather simple (albeit delicate) com- 
binatoric arguments allow to obtain all the corresponding probability distributions - see 
Section [2} Moreover the formulae we obtain are - see Appendix A.l - rather lengthy and 
cumbersome. 

This dismissive view of the problem is, in our opinion, wrong. Consequently, we tackle 
in this paper all possible questions of interest related to sports based on the (m, n, G)- 
scoring system, and solve them in full generality. The formulae we obtain are based on 
rally-level combinatorial arguments and allow us to derive the probability distribution (and 
hence the expectation and variance) of the number of rallies necessary to achieve any given 
succession of rally-outcomes. By doing so, we extend the previous contribution on the 
subject (Schulman and Hamdan 1977) where only the special case of the (5, 21, 2)-scoring 
system and only set-winning probabilities are computed. 
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The outline of the paper is as follows. In Section [2] we fix the notations, obtain the main 
theoretical results and derive the distributions of the score and game-duration. In Section [3] 
we investigate the dependence of the quantities obtained in Section [2] on the parameters 
(m,n,G), with a particular attention given to the effects of the ITTF's rule change (see 
above) on the game-winning probabilities and durations. Some final comments and an 
outlook on future research are stated in Section |4j while an appendix collects the full-length 
formulae and their proofs. 



2 Distribution of the score, set-winning probabilities and dis- 
tribution of the number of rallies in a single set. 

In this section we fix the notations and obtain the fundamental probabilities associated with 



the scoring process within a single set (see Lemma 2.2 below). We use these to derive the 
distribution of the scores, the set-winning probabilities as well as the distribution of the 
number of rallies needed to complete a set. 



2.1 Notations 



Here and throughout we denote scores by couples of integers where the first entry (resp., the 
second entry) stands for the number of points scored by player A (resp., by player B). We 
will reserve the use of the notation (n, k) (resp., (k, n)) to indicate the final score in a set 
won by A (resp., by B) without a tie. General intermediate scores will be denoted (a,/3). 

We call A-set (resp., -B-set) a set in which player A (resp., player B) is the first server. 
Note the symmetric roles played by A and B, which will often allow us to state our results 
in terms of ^4-sets only. For C\,Ci £ {A,B}, we denote by p c 2 the probability that C2 
wins a Ci-set. Now, similarly to Schulman and Hamdan (1977), it can be shown that, for m 
dividing n — 1, p A = p^ (resp., p A = i.e. the probability for each player to win a single 
set is not affected by the choice of the first server in the set. Consequently, the probability of 
player A (resp., player B) winning a match can be obtained in a straightforward manner by 
simple conditioning. Since the values of m chosen by international federations always satisfy 
the aforementioned divisibility constraint, we can (and will) restrict most of our attention 
on the outcome of a single A-sei. 

For C E {A, B}, we denote by E A the event associated with all sequences of a + j3 
rallies in an A-set that gives rise to a (resp., 0) points scored by player A (resp., by player B) 
and sees player C score the last point. The latter condition entails that we necessarily assume 
a > (resp., (3 > 0) when C = A (resp., when C = B). We denote the corresponding 
probability by 

p<*/' C = P[Ef> C ]. (2.1) 

With these notations, an ^4-set is won by A (resp., by B) on the score (n, k) (resp., 
(k,n)) with probability p n A ' (resp., p A ,n,B ), for k < n — 2. Obviously these quantities do 
not suffice to compute the probability p A that C wins a set initiated by A, since we still 
need to account for what happens in case of a tie at n — 1, an event which occurs with 

t_ t_-i-j- n—l,n—l,A . n—l,n—l,B t-i , n • • , j ,1 , tie, A / tie,_B\ 

probability p A + p A ■ r or this we introduce the notation p A (resp., p A ) 

to represent the probability that A (resp., B) scores 2 more consecutive points than his/her 
opponent after the tie; this quantity will have to be computed differently than those defined 
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above, since the rules governing the game after a tie are not the same as those which were 
applicable before this event. With these notations we get 



n-2 



A n.k.A , i n—l,n—l,A . n—l,n—l,B\ tie, A 

pa = 2^Pa +(pa +Pa )Pa > 



and 



k=0 



n-2 



B ST^- k.n.B . / n—l,n—l,A , n—l,n—l,B\ tie.B 

pa = 2^Pa +(pa +PA )pa ■ 



(2.2) 



(2.3) 



fc=0 



The corresponding quantities for S-sets are defined by switching the roles of the two players 
in these definitions. 



Determining (2.1) and computing p"^' C is the key to our understanding of the problem. 
We derive a general formula for these quantities in the following section. 



2.2 Distribution of the scores in a single set. 

In order to give the reader a feeling for what kind of mechanics are at play, we start with 
the (very simple) no-server model. Here every rally is won by player A (resp., by player B) 
with probability p (resp., 1—p), irrespective of the server. The process therefore reduces to 
a succession of independent Bernoulli trials with success-probability p, yielding 

a,B,A /a + ^-A Q g a a B f a + (3 - l\ g 

Pa =[ a _ l )P I 1 -P) andp/^ =1 ^ )p(l-p) p ( 2A ) 

for all a,/3gN. All that remains is to determine the probabilities p^' A and p^ ,B ■ Clearly, 
the number of rallies played after a tie until either player wins the set has to be even. Let 
I G No. In order to have It rallies after the tie, it suffices that A and B win alternatively 
during the first 1(1 — 1) rallies, and that one of the two players scores 2 successive points 
thereafter. This entails that the probability that A (resp., B) wins the set after 11 points 
is given by p 2 (1p(\ — p)Y~ l (resp., by (1 — p) 2 (1p(l — p))^" 1 ). Hence the probability of A 
(resp., of B) winning the set after a tie at n — 1 is given by 

tie,A P 2 i tie,B (1 ~ P) 2 ,„ c v 

Pa = i-ip(i-p) andPA = i-mi-pY (2 - 5) 



Thus, from (2.2), (2.3) and (2.4), simple summation yields the game-winning probabilities 
p\ and p^- Note that these quantities do not depend on the parameter m. 

The case of the server model is trickier. The scoring process of A (resp., of B) is a 
succession of m independent Bernoulli trials with success-probability p a (resp., 1 — p a ), 
followed by m independent trials with success-probability 1 — p& (resp., followed again 
by m Bernoulli trials identical to the first and so on, until the end of the set (or until 
the tie is reached). We aim to derive the probability of the event . We know that 

this event is equivalent to a succession of independent (albeit not identically distributed) 
binomial experiments with parameters (m,p a ) or (m, 1— p&). Hence all related distributions 
will be relatively straightforward to derive as soon as we are able to compute, for any given 
score, the number K, say, of complete service sequences (of m rallies) and the number 
R (< m), say, of remaining serves (by either A or B) during the last service sequence. This 
is performed in the following lemma (which we state without proof). 
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Lemma 2.1 Let < a, ft < n — 1. Define the integers K and R as the unique solutions of 
a+ft = Km+R with R < m—1. Then the event E°£ can be decomposed into k\ := \K/2] 
service sequences by A and Jt2 '■= \Kj1\ service sequences by B, ended by R serves of A 
(resp., of B) when K is even (resp., when K is odd). 

Let us now consider the event E^ ,C for C G {A, B}. From the nature of the game it is 
clear that, given the score (a, ft), the only random quantity left is the number of points A (or, 
equivalently, B) has obtained on his/her own serve. Hence it is natural to define the events 
E% p ' c (j) C E a / fi for which the scoring sequence occurs with A scoring exactly j points on 
his/her serve. Denoting by (j) the corresponding probability, these quantities can then 

be derived by exploring the essentially binomial nature of the experiment, as summarized 
in the following lemma. 



Lemma 2.2 Let a, ft G N, and define K,R as in Lemma 2.1 Also, for C G {A,B}, define 
S][ as the indicator of the equality between A and C. Then, (i) if R > and K is even (or 
R = and K is odd), 

where j G {max(5^, a — f^]m), . . . , min(a, [^l m + R — ^ + $a)}> (H) if R > ® and K is 
odd (or R = and K is even), 

where j G {max(0, a — [^Jw — R + 1 — 6^), ■ ■ ■ , min(a — 5%, l^]^)}- 

Note that the above condensed formulae are easier to read if spelt out explicitly in 



each of the 8 possible cases. We therefore provide a complete version of Lemma 2.2 in the 
Appendix, accompanied by a formal proof. Summing up it is now a simple matter to obtain 
the final formulae - at least for game scores that do not involve a tie. 

Theorem 2.1 Fix a, ft G N, and letp a /' C := P[E^'°], where E a /' C := \JjE^' C (j) with 

oo oo 

C G {A,B}. Then p^ 13, = p°^ (j) andp^' B = ^^P^'^C?), where the probabilities 

j=0 j=0 

Pa U) are defined in Lemma 



2.2 



When the parameters are chosen so as to satisfy the constraint p a = 1 — p^ = p, the 
corresponding results concur with those obtained for the no-server model. In all other cases, 
these results do not allow for an agreeable form so we dispense with their explicit expression. 
The behavior of these quantities in terms of the ruling parameters of the game is illustrated 
(both in the old and new scoring systems) in Figure [3| Finally regarding the winning 



probabilities in case of a tie, the same reasoning as for (2.5) reveals that the probability 
that A (resp., B) wins the set after U points is given by p a (l—pb)((l —p a )(l—pb) +PaPbY~ 1 
(resp., by (1 — p a )Pb((^ ~Pa){^ —Pb) +PaPb)^ 1 )- The probability of A (resp., of B) winning 
the set after a tie at n — 1 readily follows and thus, combining Theorem |2.1| and equations 



(2.2) and (2.3) yields p\ and p^, as desired. Because these probabilities are particularly 
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cumbersome to spell out explicitly we rather choose to illustrate their behavior (both in 
the old and new scoring systems) numerically and graphically (see Table [I] and the different 
figures below). 

2.3 Distribution of the number of rallies in a single set. 

The results established in the previous section allow us now to investigate the distribution 
of the number of rallies, D say, needed to end a single set. Let S stand for the random 
variable recording the first server of a set, and, for C 6 {A,B} and E an event, denote by 
Pc[-E] := P[E \ S = C] the probability of E conditional upon who serves first in a set. The 
main object of interest in this section is the mapping d i— > Pa[D = d] for rfgN. 

Obviously, because a point is scored on every rally, the number of rallies needed to 
reach a score (a,f3) is equal to a + (3. Hence Pa[D = d] = for all d < n — 1 and 
Pa[D = d] = p r ^ d ~ n,A -j- p d ~ n > n ' B f or all n < d < 2(n — 1). Finally, for d > 2n — 1, such 
length can only be achieved through the occurrence of a tie so that Pa[D = d] = if 
d = 2n — 1 and otherwise Pa[D = d] = (p A ~ 1,n ~ 1, + p^ -1 ' n-1 ' )p t le, where the latter 
quantity gives the probability that there are exactly d — 2(n — 1) rallies after the tie. All 
that remains is therefore to compute p t Q- For this first note that p t Q = if d — 2{n — 1) is 
odd. Next it is a simple matter of applying the same arguments as in Section |2.2| to obtain 
p d t Q = (p a (l - p b ) + (1 - Pa)Pb){{l - Pa){l ~ Pb) + PaPbY' 1 with 21 = d - 2{n - 1). The 
distribution of D (as well as its moments) readily follows, as illustrated in Table [TJ Figure [T] 
and Figure [4} 

The evolution of the expectation and variance of the number of rallies D in an ^4-set 
unconditional on the winner of that set are displayed in Figure [l] (Note that the effect of 
a variation of n in one set played according to the (n, m, G)-rule will be a straightforward 
rescaling. Plots are therefore not provided.) The expectation curves are quite smooth and 
unimodal around p a = Pb, while the variance curves are also smooth but exhibit a slightly 
bimodal nature around p a = Pb- These findings are in accordance with the intuition that 
players of approximately the same strength play longer. Moreover, when the strengths are 
only slightly different, the variance becomes biggest, as in that case there may be both tight 
and uneven sets. Finally note that when pb — > or pb — > 1 the expected duration is not 
equal to 21 as one might think at first sight but is rather around 25. This evidently follows 
from the fact that p a is confined between .3 and .7, and hence when player A serves the 
outcomes remain random. 

3 Comparing scoring systems in table tennis. 

In this section, we discuss the influence of the parameters (m, n, G) in terms of score dis- 
tributions, durations and match-winning probabilities. Our main interest is to discuss the 
effects of the switch from the "old" (5, 21, GQ-scoring system to the "new" (2, 11, G)-scoring 
system. 

We first stress how little influence the choice of m has on the different distributions, 
as long as m is chosen so as to satisfy the above described divisibility constraints. This is 
illustrated in Figure [2j where the ratio between the expected durations (Figure [2] a) and 
standard deviations (Figure [2] b) under the (5,11,1) and the (2,11,1) rules are reported. 
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Figure 1: Evolution of expectation and variance 
Both subfigures refer to an ^4-set played under the (5, 21, l)-scoring system. Subfigure (a) shows the evolution 
of the expectation and subfigure (b) shows the evolution of the variance of the duration D unconditional on 
the winner of the set. 
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Figure 2: Ratio of expectations and standard deviations, as a function of p&, between new 
and old scoring systems 

Both plots refer to an A-set played with n = 11 and G = 1 for different values of p a , as a function of Pb- 
Plot (a) shows the ratio between the expected durations under the (11, 2, 1) rule and that under the (11, 5, 1) 
rule; Plot (b) reports the ratio between the standard deviations. 



Aside from the fact that the different curves cross the horizontal line y = 1 at each couple 
(p a ,Pb) such that p a + pb = 1, we draw the reader's attention to how little variation these 
quantities endure. This encourages to disregard the role of m in the future, and concentrate 
on the dependence on the parameters n, G and (p a ,Pb)- 

Table [T] shows the probability that A wins a set, in which he/she serves first, both for the 
old and new scoring system, as well as the expected duration of such a set in both scoring 
systems. The two first parts in Table fl] list the values of p\ for all possible combinations 
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Table 1: Probability and Expected Duration 





1 
VJ. 1 


0.2 


0.3 


0.4 


Pb 
0.5 


0.6 


0.7 


0.8 


n q 
u.y 


Pa 




Probability that A 


wins a set when A serves first with m = 5 and n = 21 




U. 1 


U.OUUU 


0.1657 


0.0404 


0.0072 


0.0008 











U 


n o 
v.z 


U.oo4Z 


0.5000 


0.2190 


0.0702 


0.0159 


0.0023 


0.0002 





n 
U 


U.o 


u.yoyo 


0.7809 


0.5000 


0.2430 


0.0863 


0.0209 


0.0030 


0.0001 


n 
U 


n a 

U.4 


n ooofi 

u.yyzo 


0.9297 


0.7569 


0.5000 


0.2530 


0.0914 


0.0209 


0.0023 


n 
u 


U.O 


u.yyyi 


0.9840 


0.9136 


0.7469 


0.5000 


0.2530 


0.0863 


0.0159 


U.UUUo 


U.O 


u.yyyy 


0.9976 


0.9790 


0.9085 


0.7469 


0.5000 


0.2430 


0.0702 


n novo 


U. / 


u.yyyy 


0.9998 


0.9969 


0.9790 


0.9136 


0.7569 


0.5000 


0.2190 


U.U4U4 


0.8 


I 


0.9999 


0.9998 


0.9976 


0.9840 


0.9297 


0.7809 


0.5000 


0.0166 


n q 
u.y 




1 


0.9999 


0.9999 


0.9991 


0.9928 


0.9595 


0.8342 


u.uuuu 


Pa 




Probability that A 


wins a set when A serves first with m = 2 and n = 11 




U. 1 


U.OUUU 


0.2300 


0.0951 


0.0343 


0.0103 


0.0024 


0.0003 





U 




u. /oyy 


0.5000 


0.2819 


0.1376 


0.0568 


0.0189 


0.0046 


0.0006 


U 


U.o 


n OH/1 Q 

u.yu4o 


0.7180 


0.5000 


0.3035 


0.1576 


0.0673 


0.0219 


0.0046 


n nnnQ 
U.UUUo 


U.4 


u.yooo 


0.8623 


0.6964 


0.5000 


0.3121 


0.1635 


0.0673 


0.0189 


U.UUZ4 


U.O 


u.yoyo 


0.9431 


0.8423 


0.6878 


0.5000 


0.3121 


0.1576 


0.0568 


U.UlUo 


U.O 


n nri7c; 

u.yy / o 


0.9810 


0.9326 


0.8364 


0.6878 


0.5000 


0.3035 


0.1376 


U.Uo4o 


U. / 


n Ann/; 

U.yyyo 


0.9953 


0.9780 


0.9326 


0.8423 


0.6964 


0.5000 


0.2819 


0.9513 


0.8 


QQQQ 


0.9993 


0.9953 


0.9810 


0.9431 


0.8623 


0.7180 


0.5000 


2300 


0.9 


0.9999 


0.9999 


0.9996 


0.9975 


0.9896 


0.9656 


0.9048 


0.7699 


0.5000 


Pa 




Expected duration of a set when A serves 


first with 


m = 5 and 


n = 21 




0.1 


40.1616 


36.5982 


33.7171 


31.3842 


29.3013 


27.4362 


25.8432 


24.5067 


23.3333 


0.2 


38.6311 


38.2747 


36.3666 


34.0094 


31.7268 


29.6526 


27.8216 


26.2500 


24.9032 


0.3 


36.0125 


37.5782 


37.6279 


36.3163 


34.2761 


32.0696 


29.9949 


28.1681 


26.6186 


0.4 


33.1567 


35.4021 


36.9720 


37.3272 


36.3937 


34.5748 


32.4159 


30.3135 


28.4663 


0.5 


30.5940 


32.7743 


34.9382 


36.6022 


37.2359 


36.6022 


34.9382 


32.7743 


30.5940 


0.6 


28.4663 


30.3135 


32.4159 


34.5748 


36.3937 


37.3272 


36.9720 


35.4021 


33.1567 


0.7 


26.6186 


28.1681 


29.9949 


32.0696 


34.2761 


36.3163 


37.6279 


37.5782 


36.0125 


0.8 


24.9032 


26.2500 


27.8216 


29.6526 


31.7268 


34.0094 


36.3666 


38.2747 


38.6311 


0.9 


23.3333 


24.5068 


25.8433 


27.4362 


29.3013 


31.3843 


33.7172 


36.5982 


40.1616 


Pa 




Expected duration of a set when A serves 


first with 


m = 2 and 


ra = 11 




0.1 


22.5402 


19.9063 


18.0171 


16.5767 


15.4215 


14.4575 


13.6267 


12.8901 


12.2222 


0.2 


20.4333 


20.0069 


19.0096 


17.8241 


16.6473 


15.5657 


14.6029 


13.7513 


12.9904 


0.3 


18.7138 


19.3163 


19.2465 


18.6612 


17.7620 


16.7301 


15.6955 


14.7315 


13.8635 


0.4 


17.2440 


18.2349 


18.8266 


18.9228 


18.5459 


17.8052 


16.8511 


15.8300 


14.8509 


0.5 


15.9686 


17.0222 


17.9547 


18.5985 


18.8285 


18.5985 


17.9547 


17.0222 


15.9686 


0.6 


14.8509 


15.8300 


16.8511 


17.8052 


18.5459 


18.9228 


18.8266 


18.2349 


17.2440 


0.7 


13.8635 


14.7315 


15.6955 


16.7301 


17.7620 


18.6612 


19.2465 


19.3163 


18.7138 


0.8 


12.9904 


13.7513 


14.6029 


15.5657 


16.6473 


17.8241 


19.0096 


20.0069 


20.4333 


0.9 


12.2222 


12.8901 


13.6267 


14.4575 


15.4215 


16.5767 


18.0171 


19.9063 


22.5402 
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Table 2: Ratio of Expected Durations and Standard Deviations 



T12 / 


n\ 


0.1 


0.2 


0.3 


0.4 


0.5 








Ratio of expectations 




21/11 = 


1.9090 


1.9000 


1.9058 


1.9297 


1.9623 


1.9776 


31/21 = 


1.4761 


1.4670 


1.4697 


1.4797 


1.4963 


1.5052 


41/31 = 


1.3225 


1.3149 


1.3172 


1.3221 


1.3326 


1.3392 


31/11 = 


2.8181 


2.8231 


2.8276 


2.8728 


2.9419 


2.9769 


41/11 = 


3.7272 


3.7482 


3.7527 


3.8151 


3.9260 


3.9867 








Ratio of Standard 


Deviations 




a/21/11 = 


= 1.3816 


1.3437 


1.3037 


1.3426 


1.2975 


1.2356 


v/31/21 = 


= 1.2149 


1.1789 


1.2171 


1.2527 


1.2356 


1.1754 


x/41/31 = 


= 1.1500 


1.1553 


1.1500 


1.1790 


1.1863 


1.1349 


a/31/11 = 


= 1.6787 


1.5490 


1.6080 


1.6819 


1.5944 


1.4524 


a/41/11 = 


= 1.9306 


1.8476 


1.8529 


1.9749 


1.8816 


1.6484 



of couples (p a ,Pb) varying respectively from 0.1 to 0.9 with a step size of 0.1. A simple 
inspection of the probabilities indicates that the stronger one of the two players inevitably 
wins the set, which is a confirmation of the common intuition. The main difference between 
the two scoring systems lies in the fact that on the one hand, when p a > pb (values under 
the diagonal), player A has a higher chance to win the set in the old system than in the 
new one, whereas on the other hand, when p a < pb (values over the diagonal), player A J s 
(small) chances to win the set are higher in the new system. This can be explained as 
follows: the more points are necessary to win the set, the more important the strengths 
p a and pb become, and the "victory against the odds" is therefore less probable in the 
old scoring system than in the new one. We also draw the reader's attention to the fact 
that p^(p a ,Pb) = Pa(1 - Pb, 1 - Pa) and p\{p a -,Pb) + p\{Pb,Pa) = 1 (with p\(x, y) standing 
for the probability that player A wins an A-set under the conditions p a = x and Pb = U, 
x,y £ (0,1)). These phenomena are discussed in Schulman and Hamdan (1977), and will 
not be further elaborated on here. 

The two last parts of Table [T] show the expected duration of a set in which player A 
serves first. The sets with the largest expected durations are those on the diagonal, where 
both players are roughly of similar strength. Clearly the duration of a set is smaller in the 
new scoring sytem than in the older one. More interesting is the fact that the duration 
of a set in the old scoring system is always roughly double that in the new scoring system 
or, more precisely, that we have e^ old /e^ new ~ 21/11. This seemingly linear relationship 
between the average length of a set and the number of points played is easily confirmed 
by further computations, as shown in Table [2] Likewise, from Table [2j we see that the 
same multiplicative factor (under a square root) appears when considering the standard 
deviation of D. The same conclusions hold true when passing from a single set to a full 
match. From this we deduce that in order for the ITTF, for instance, to derive a scoring 
triple (rri2, ri2, G2) which does not change the average length of the game when switching 
from the original triple (mi, ni, Gi), it is advisable to choose the number of sets G2 in such 
a way that G\n\ = G2U2- If, for instance, n\ = 21 and G\ = 3, then the choice of ng = 11 
and G2 = 6 would influence the expected durations less than G2 = 3 or G2 = 4. The choice 
of G2 = 3 or 4 nevertheless ensures a difference that is negligible (see Figures [5] and [6] and 
[7] where different choices of triples are compared in the no-server model) . 

Figure [3] presents, for n = 11 and n = 21 respectively, the score distribution associated 
with (p a ,Pb) = (-7, .5), (.6, .5), (.5, .5), (.4, .5) and (.3, .5). Subplots (a) and (c) contain the 
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(a) 



(b) 




Figure 3: Distribution of the scores with pb = .5 
Subfigures (a) and (b) refer to an A-set played under the (2, 11, l)-scoring system. Subfigure (a): for 
{Pa,Pb) = (.7, .5), (.6, .5), (.5, .5), (.4, .5) and (.3, .5), probabilities p^ k ' A that player A wins the set on the 
score (n,k). Subfigure (b): the corresponding values for victories of B. Subfigures (c) and (d) refer to an A- 
set played under the (5, 21, l)-scoring system. Subfigure (c): for (p a ,p b ) — (.7, .5), (.6, .5), (.5, .5), (.4, .5) and 
(.3, .5), probabilities p r ^ k ' A that player A wins the set on the score (n, k). Subfigure (d): the corresponding 
values for victories of B. The different set-winning probabilities p^ for C £ {A, B} and each couple of values 
(Pa,Pb) are given in each subfigure. 
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probabilities that player A wins an A-set in which the opponent scores a total of k points (for 
instance, in the new scoring system, k = 0, . . . , 9 corresponds to a victory of A on the score 
(11, 0),. . . ,(11, 9), and in case of a tie, for which k > 10, player A wins on scores of the form 
(k+2, k)) and the subplots (b) and (d) give the probability for B winning an A-set where this 
time player A scores k points. It appears that these distributions are highly sensitive to the 
values of (p a ,Pb)', obviously, this extends to the corresponding match-winning probabilities. 
In the (5, 21, G)-scoring system, varies between .09 and .91 and p^ between .91 and .09, 
when, for fixed pb = .5, p a ranges from .3 to .7. For the other scoring system, p\ takes 
values between .16 and .84, while p 1 ^ ranges from .84 to .16. This confirms our previous 
findings that the difference in strengths is more important in the old than in the new system. 
The following three comments hold for both scoring systems: (i) as A (resp., B) is getting 
stronger with respect to B (resp., ^4), his/her winning probability dramatically increases, 
(ii) whatever the value of p a , white- washes seldom occur, and (iii) the score distributions are 
not monotone in p a . Finally note that subfigures (a) and (b), as well as subfigures (c) and 
(d), display the same curves with the colors in reverse order; this is due to the symmetric 
roles played by the two players. 

In Figure [4] we plot, in the setup of an A-set and for (p a ,Pb) = (-5, .5), (.4, .5) and 
(.3, .5) (we omit the couples (.7, .5) and (.6, .5) as the symmetry of the situation implies a 
confounding with the plots for (.3, .5) and (.4, .5), respectively), the unconditional (on the 
winner of the set) probabilities that the number of rallies D equals d £ [0, 60] with respect 
to that value d. The expectations of D in each case are indicated as well. Subfigure (a) 
corresponds to the new scoring system, and subfigure (b) to the old one. In the old scoring 
system the probabilities are more dispersed than in the new scoring system; this is a direct 
consequence of the larger number of points needed to play until either player wins the set. 
For both plots, the peaks that occur after the main peak are due to the fact that a set can 
only end after an even number of rallies in case of a tie. The red curve is the one that 
takes the highest values, which confirms the intuition that two players of the same strength 
take more time to battle out the set and that the scores are quite tight (see also the related 
expected values = 18.83 in the new scoring system and = 37.24 in the old one). 

The dependence of the set-winning probabilities on (p a ,Pb) is of primary importance. In 
what follows, we will investigate this dependence visually by comparing both scoring systems 
in the no-server model (p = p a = 1—pb). The results are illustrated in Figure[5j Figure[5](a) 
shows the probability that player A wins an A-set as a function of his/her strength p. The 
blue curve gives the probability for the new scoring system and the red curve for the old 
scoring system. The plot supports the claim that, for most values of p, the choice of the 
scoring system does not influence the set-winning probabilities. The only slight differences 
appear within [.3, .45] and [.55, .7], where the new scoring system quicker equalizes both 
players, which is in agreement with our previous findings. These observations are further 
supported by Figure [5] (b), where the ratio between the old and the new scoring system is 
plotted as a function of p. This graph reveals that although the probability that player A 
wins an ^4-set is essentially the same for both scoring systems if he/she is the best player 
{PA° ld /Pa^™ ~ 1 for p > .7, with evident notations), for p € [.5, .7] the probability that A 
wins the ^4-set in the old scoring system is slightly higher than in the new one, and when 
p < .5, player A is more likely to win the set in the new system than in the old one. Finally, 
the ratio approaches as p tends to (for p < .2, say). This last remark indicates that the 
old scoring system leaves severely less winning chances to A when he/she is weak than the 
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Figure 4: Distribution of D with = .5 
Both plots refer to an A-set. For (p a ,Pb) ~ (-5, .5), (.4, .5) and (.3, .5) they report the probabilities that the 
number of rallies D needed to finish the set takes value d unconditional on the winner of the set. Plot (a) 
shows the probabilities and expectations for the (2, 11, l)-scoring system and plot (b) for the (5, 21, l)-scoring 
system. 
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Figure 5: Comparison: one set 
As a function of p = p a — 1 — pb (that is, in the no-server model), subfigure (a) contains the probabilities p\ 
(in red) that player A wins an A-set in the (21, 5, l)-scoring system, along with the probabilities p\ (in blue) 
that player A wins an A-set in the (11, 2, l)-scoring system. Subfigures (b) and (c) show the expectation 
and the standard deviation of the number of rallies needed to complete the corresponding set, unconditional 
on the winner. 
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new one; in other words, we see again that the new scoring system appears more balanced. 
The duration plots simply confirm what common-sense already tells us. Indeed, we see in 
Figure[5](c) that the expectation of D is, uniformly in p G (0, 1), smaller for the new scoring 
system than for the old one; this is of course a direct consequence of the dramatic reduction 
of the number of points necessary to win the set. The same conclusion happens to be true for 
the standard deviation of D (the old scoring system varies between 3.26 and 4.29, the new 
scoring system between 2.42 and 3.22), as illustrated in Figure [5] (d). The twin-peak shape 
of both standard error curves is most certainly due to the quasi-equality in rally-winning 
probabilities between both players which leads to both tighter final scores and more uneven 
scores. Finally note that both curves of Figures [5] (c) and (d) are symmetric about p = .5, 
as expected. 

We conduct, in Figure [6| a similar comparison between both scoring systems at match 
level. More precisely, we compare a (5, 21, 3)-scoring system (red curve) with a (2,11,4)- 
scoring system (blue curve). Subfigures (a) and (b) of Figure [6] (to be compared with the 
corresponding subfigures in Figure [5]) indicate that, for any fixed p, the higher number 
of sets needed to achieve victory in the new scoring system ensures that match-winning 
probabilities remain roughly the same before and after the rule change. Subfigures (c) and 
(d), i.e. the expectation and standard deviation plots, convey the same message as for a 
single set, except that here the two curves of the standard deviation cross at some points. 
Note that the standard deviation is in general smaller in the new scoring system, hence the 
change has made the length of the match more predictable. Finally in Figure [7] we show, 
for the sake of illustration, how choosing G n ew and G \^ such that n n ewG n ew ~ n old^old 
ensures - as is intuitively clear - that both the probabilities and the expectations remain 
largely unchanged when switching from (m ^, n old , G ^) to ( ^new; n new> Gnew)- 



4 Final comments. 

This paper provides a complete probabilistic description of games played according to the 
(m, n, G)-scoring system. It complements and extends the previous contribution by Schul- 
man and Hamdan (1977). The formulae provided, although cumbersome, can easily be 
implemented for numerical purposes, and yield some striking illustrations. The results of 
our paper can be useful, in practice, to the sport community (e.g. the ITTF) and to TV 
broadcasting programs, as they allow to analyze any (m, n, G)-scoring system. Our findings 
have allowed us to perform an in-depth comparison of the old and new scoring system in 
table tennis. If the aims behind the system change were (i) a better control of the length of a 
match and (ii) an increase in the potential number of crucial points without influencing the 
relative strengths of the players too much (i.e. without changing the winning probabilities), 
then our results confirm that these goals are achieved. 

We also mention how our results also allow for constructing elegant estimation procedures 
of the respective strengths of the players. Obviously disposing of the full formulas permits 
to work out Maximum Likelihood Estimators for p a , pt>. These will necessarily be obtained 



by numerical maximization on basis of the formulae in Lemma 2.2 There is, however, a 
more elegant (and efficient) way to perform such estimation, which only requires retaining 
one more information from a given encounter between two players, namely the number 
of points scored by player A (resp., player B) on his/her own serve and on the serve of 
his/her opponent. Indeed with this information one can show that the maximum likelihood 
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Figure 6: Comparison: a match 
As a function of p = p a — 1 — Pb (that is, in the no-server model) , subfigure (a) contains the probabilities 
Pa (in red) that player A wins an A-match in the (21, 5, 3)-scoring system, along with the probabilities 
p\ (in blue) that player A wins an A-match in the (11, 2, 4)-scoring system. Subfigures (b) and (c) show 
the expectation and the standard deviation of the number of rallies needed to complete the corresponding 
match, unconditional on the winner. 
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Figure 7: Comparison: a match 
As a function of p = p a — 1 — Pb (that is, in the no-server model) , subfigure (a) contains the probabilities 
Pa (in red) that player A wins an A-match in the (21, 5, 4)-scoring system, along with the probabilities 
p\ (in blue) that player A wins an A-match in the (11, 2, 8)-scoring system. Subfigures (b) and (c) show 
the expectation and the standard deviation of the number of rallies needed to complete the corresponding 
match, unconditional on the winner. 
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estimator for p a , say, is given by the ratio of the number of points won by A on his/her 
serve and the total number of his/her serves. Based on Bradley- Terry paired comparison 
methods, these estimators will allow for constructing interesting new ranking methods within 
round-robin tournaments. This will be the subject of a future publication. 



Finally, since our results of Lemma 2.2 are valid for general intermediate scores (a,/3), 
an extension to the case where the initial score differs from (0, 0) is readily implemented. 
This naturally paves the way to a more dynamic analysis of a match, a la Klaassen and 
Magnus (2003), in which the winning probabilities and duration are sequentially estimated 
throughout the course of a given match. 
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A Appendix. 

A.l Complete formulae. 

Lemma A.l Fix m G No and a, (3 G N, and define K, R G N via the Euclidian division 
a + (3 = Km + R. Then we have 

1. if R = and K is even, then 



where x £ {max(0, a — |m), . . . , min(a — 1, |-m)}, and 



where x G {max(0, a — + 1), . . . , min(a, ^m)}. 
2. if R = and K is odd, then 

p a /' A (-) = C f]m ; 1 )p^-p a ]m - x ( [flm )(i- Pb r 

\ x — 1 J \a — x J 

where x G {max(l, a — \_^\m), . . . , min(a, |~y]7n)}, and 

where x G {max(0, a — IyI 171 )} ■ ■ ■ > rain (a, [^l 771 — 1)}. 



l^-\m—a+x 
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3. if R > and K is even, then 



where x G {max(l, a — ym), . . . , min(a, 4f m + R)}> an d 

where x G {max(0, a — Ifm), . . . , min(a, 4fm + -R — 1)}. 
^. if R > and K is odd, then 



where x G {max(0, a — [^J — -R), . . . , min(a — 1, [^l 771 -)}; an d 



where x G {max(0, a — LyJ" 7 — -R + 1), . . . , min(a, fy]???,)}. 
A. 2 Proof of the complete formulae. 

Since all the formulae follow a very similar pattern, we only provide formal proofs for two 
of the eight expressions, namely for p 3 ^' {x) when R = and K is even and for p°^' B (x) 
when R > and K is odd. 

Suppose a and f3 are such that a + j3 = Km with K even, and player A scores the last 
point. In this setting, the last point is served by player B, hence only the outcome of 
serves of player B are random, while all the ym serves of player A are random. Thanks 
to the independence of the rallies, we may consider the serves of player A and player B 
separately. Moreover, as explained in Section [2j each sequence of serves corresponds to 
a binomial distribution, with parameters {^-m,p a ) for player A serving and parameters 
(4f m — 1) 1 — Pb) fo r player B serving, where we adopt each time the point of view of 
player A. Since player A scores x points on his/her own serve, he/she necessarily wins a — x 
rallies initiated by player B. The last point being considered apart, we are left with a — x — 1 
successes for the binomial distribution with parameters (^m — 1, 1 — pt>). Combining all 
these facts yields the announced formula, and it remains to establish the domain of the 
possible values of x. 

As player B serves exactly times, it is clear that, whenever a > ^m, necessarily 
player A has to score at least a — ym points on his/her own serve. Otherwise, he/she may 
very well score only on player B's serve. These two observations readily yield the lower 
bound for x. Regarding the upper bound, as player A scores at least one point not served 
by himself /herself, he/she cannot score more than a — 1 points on his/her own serve. This 
directly yields the upper bound, as player A is limited to serves. 
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Suppose now that a + (3 = Km + R with K odd and R > 0, and player B scores the last 
point. In this case, player B also serves the last point; would R be zero, then the oddness 
of K would give the last serve to player A. With this said, player A serves exactly ]m 
points, while player B initiates |_T-I m + R exchanges. The formula for p a A B (x) and the 
corresponding lower and upper bound for x follow along the same lines as for p°^' A {x), 
which concludes the proof. □ 
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